A Blanket Binarization Method for Character String Extraction
نویسندگان
چکیده
In this paper, a binarization method based on fractal dimension for character string extraction is proposed. In character extraction from a scene image, a major problem is how to deal with much different type of characters in a complex background. The proposed method can obtain multiple threshold values which are correspond to each character regions by detecting the stable intervals of fractal dimension FD. The stable interval is a relatively low and flat valley of the FD which indicates the binarized image has the stable connected regions, and therefore fine character regions have been appeared. The character regions may contain some noise and has conflictions between the regions derived with another threshold values. We call these character region as a ”Candidate Character Region Images”(CCRI), and will be processed by noise-reduction consists of two steps. After that, CCRI are integrated into one binarized image as output image through the contention resolution process. We show the performance of the proposed method by comparing Niblack’s method as a local method and Otsu’s method as a global method on the dataset provided at ICDAR 2003.
منابع مشابه
A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors
We present a fast algorithm for Korean text extraction and segmentation from subway signboards using smart phone sensors in order to minimize computational time and memory usage. The algorithm can be used as preprocessing steps for optical character recognition (OCR): binarization, text location, and segmentation. An image of a signboard captured by smart phone camera while holding smart phone ...
متن کاملAn Effective Edge and Texture Based Approach towards Curved Videotext Detection and Extraction
In present day video text greatly helps video indexing and retrieval system as they often carry significant semantic information. Video text analysis is challenging due to varying background, multiple orientations and low contrast between text and non-text regions. Proposed approach explores a new framework for curved video text detection and recognition where from the observation that curve te...
متن کاملUsing Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Image
Compared to binary images that most text extraction methods work on, gray scale images provides much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (ie. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images ...
متن کاملUsing Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Images
Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (ie. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images a...
متن کاملText Extraction and Text Binarization Algorithms
In the conventional method, grey image binarization processing with a given threshold is employed to extract high intensity video character regions. A corner based approach to detect text and caption from videos is presented in [47]. This approach is inspired by the observation that there exist dense and orderly presences of corner points in characters, especially in text and caption. The usage...
متن کامل